Overview

Dataset statistics

Number of variables21
Number of observations9879
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 MiB
Average record size in memory168.0 B

Variable types

NUM10
CAT7
BOOL4

Reproduction

Analysis started2020-06-30 23:28:45.453512
Analysis finished2020-06-30 23:29:01.210819
Duration15.76 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

blueCSPerMin is highly correlated with blueTotalMinionsKilledHigh correlation
blueTotalMinionsKilled is highly correlated with blueCSPerMinHigh correlation
blueGoldPerMin is highly correlated with blueTotalGoldHigh correlation
blueTotalGold is highly correlated with blueGoldPerMinHigh correlation
gameId has unique values Unique
blueWardsDestroyed has 745 (7.5%) zeros Zeros
blueAssists has 217 (2.2%) zeros Zeros
blueTowersDestroyed has 9415 (95.3%) zeros Zeros

Variables

gameId
Real number (ℝ≥0)

UNIQUE

Distinct count9879
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4500084044.8455305
Minimum4295358071
Maximum4527990640
Zeros0
Zeros (%)0.0%
Memory size77.2 KiB

Quantile statistics

Minimum4295358071
5-th percentile4448162119
Q14483301169
median4510920346
Q34521733208
95-th percentile4526401226
Maximum4527990640
Range232632569
Interquartile range (IQR)38432039.5

Descriptive statistics

Standard deviation27573278.49
Coefficient of variation (CV)0.006127280783
Kurtosis3.334606538
Mean4500084045
Median Absolute Deviation (MAD)13393261
Skewness-1.459122438
Sum4.445633028e+13
Variance7.602856867e+14
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
44583833591< 0.1%
 
44928709861< 0.1%
 
44479929711< 0.1%
 
45178893621< 0.1%
 
45240776121< 0.1%
 
44911281431< 0.1%
 
44922539921< 0.1%
 
45270784341< 0.1%
 
45240681711< 0.1%
 
44660401371< 0.1%
 
Other values (9869)986999.9%
 
ValueCountFrequency (%) 
42953580711< 0.1%
 
42960047841< 0.1%
 
42960366921< 0.1%
 
42963545351< 0.1%
 
42972090681< 0.1%
 
ValueCountFrequency (%) 
45279906401< 0.1%
 
45279604591< 0.1%
 
45279096971< 0.1%
 
45279088581< 0.1%
 
45278984861< 0.1%
 

blueWins
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
No
4949
Yes
4930
ValueCountFrequency (%) 
No494950.1%
 
Yes493049.9%
 

blueWardsPlaced
Real number (ℝ≥0)

Distinct count147
Unique (%)1.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.28828828828829
Minimum5
Maximum250
Zeros0
Zeros (%)0.0%
Memory size77.2 KiB

Quantile statistics

Minimum5
5-th percentile12
Q114
median16
Q320
95-th percentile53
Maximum250
Range245
Interquartile range (IQR)6

Descriptive statistics

Standard deviation18.01917652
Coefficient of variation (CV)0.8084594152
Kurtosis23.43945163
Mean22.28828829
Median Absolute Deviation (MAD)2
Skewness4.136352605
Sum220186
Variance324.6907223
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16125512.7%
 
15121712.3%
 
1798810.0%
 
149749.9%
 
188318.4%
 
136947.0%
 
194834.9%
 
124474.5%
 
202882.9%
 
112392.4%
 
Other values (137)246324.9%
 
ValueCountFrequency (%) 
52< 0.1%
 
71< 0.1%
 
8160.2%
 
9390.4%
 
10961.0%
 
ValueCountFrequency (%) 
2501< 0.1%
 
2211< 0.1%
 
2091< 0.1%
 
2031< 0.1%
 
1981< 0.1%
 

blueWardsDestroyed
Real number (ℝ≥0)

ZEROS

Distinct count27
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.824881060836117
Minimum0
Maximum27
Zeros745
Zeros (%)7.5%
Memory size77.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q34
95-th percentile6
Maximum27
Range27
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.174998382
Coefficient of variation (CV)0.7699433482
Kurtosis17.19675844
Mean2.824881061
Median Absolute Deviation (MAD)1
Skewness2.845981594
Sum27907
Variance4.730617963
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2235723.9%
 
3211621.4%
 
1179018.1%
 
4141314.3%
 
57467.6%
 
07457.5%
 
63453.5%
 
71631.6%
 
8680.7%
 
9220.2%
 
Other values (17)1141.2%
 
ValueCountFrequency (%) 
07457.5%
 
1179018.1%
 
2235723.9%
 
3211621.4%
 
4141314.3%
 
ValueCountFrequency (%) 
271< 0.1%
 
251< 0.1%
 
241< 0.1%
 
231< 0.1%
 
222< 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Yes
4987
No
4892
ValueCountFrequency (%) 
Yes498750.5%
 
No489249.5%
 

blueKills
Real number (ℝ≥0)

Distinct count21
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.18392549853224
Minimum0
Maximum22
Zeros63
Zeros (%)0.6%
Memory size77.2 KiB

Quantile statistics

Minimum0
5-th percentile2
Q14
median6
Q38
95-th percentile12
Maximum22
Range22
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.011027975
Coefficient of variation (CV)0.4869120716
Kurtosis0.2637881975
Mean6.183925499
Median Absolute Deviation (MAD)2
Skewness0.5385175399
Sum61091
Variance9.066289468
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6132213.4%
 
5130213.2%
 
4118612.0%
 
7113811.5%
 
89429.5%
 
39179.3%
 
97177.3%
 
26096.2%
 
105275.3%
 
113403.4%
 
Other values (11)8798.9%
 
ValueCountFrequency (%) 
0630.6%
 
13133.2%
 
26096.2%
 
39179.3%
 
4118612.0%
 
ValueCountFrequency (%) 
221< 0.1%
 
192< 0.1%
 
184< 0.1%
 
17130.1%
 
16300.3%
 

blueDeaths
Real number (ℝ≥0)

Distinct count21
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.137665755643284
Minimum0
Maximum22
Zeros72
Zeros (%)0.7%
Memory size77.2 KiB

Quantile statistics

Minimum0
5-th percentile2
Q14
median6
Q38
95-th percentile11
Maximum22
Range22
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.933817709
Coefficient of variation (CV)0.4780021959
Kurtosis0.2140976082
Mean6.137665756
Median Absolute Deviation (MAD)2
Skewness0.5074928208
Sum60634
Variance8.607286351
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5134113.6%
 
6129313.1%
 
4122112.4%
 
7118812.0%
 
89429.5%
 
39349.5%
 
97347.4%
 
26036.1%
 
104945.0%
 
113313.4%
 
Other values (11)7988.1%
 
ValueCountFrequency (%) 
0720.7%
 
12702.7%
 
26036.1%
 
39349.5%
 
4122112.4%
 
ValueCountFrequency (%) 
221< 0.1%
 
192< 0.1%
 
182< 0.1%
 
1780.1%
 
16200.2%
 

blueAssists
Real number (ℝ≥0)

ZEROS

Distinct count30
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.645105779937241
Minimum0
Maximum29
Zeros217
Zeros (%)2.2%
Memory size77.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q14
median6
Q39
95-th percentile14
Maximum29
Range29
Interquartile range (IQR)5

Descriptive statistics

Standard deviation4.0645199
Coefficient of variation (CV)0.6116561623
Kurtosis1.159114492
Mean6.64510578
Median Absolute Deviation (MAD)3
Skewness0.8902611921
Sum65647
Variance16.52032202
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5106810.8%
 
4101010.2%
 
69359.5%
 
39269.4%
 
78808.9%
 
88438.5%
 
27317.4%
 
96486.6%
 
105415.5%
 
14684.7%
 
Other values (20)182918.5%
 
ValueCountFrequency (%) 
02172.2%
 
14684.7%
 
27317.4%
 
39269.4%
 
4101010.2%
 
ValueCountFrequency (%) 
292< 0.1%
 
281< 0.1%
 
271< 0.1%
 
263< 0.1%
 
2570.1%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
0
5156
1
4013
2
 
710
ValueCountFrequency (%) 
0515652.2%
 
1401340.6%
 
27107.2%
 

Length

Max length1
Median length1
Mean length1
Min length1
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
0
6303
1
3576
ValueCountFrequency (%) 
0630363.8%
 
1357636.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
0
8022
1
1857
ValueCountFrequency (%) 
0802281.2%
 
1185718.8%
 

blueTowersDestroyed
Real number (ℝ≥0)

ZEROS

Distinct count5
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05142220872557951
Minimum0
Maximum4
Zeros9415
Zeros (%)95.3%
Memory size77.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum4
Range4
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.2443691686
Coefficient of variation (CV)4.752210662
Kurtosis39.85959574
Mean0.05142220873
Median Absolute Deviation (MAD)0
Skewness5.590241464
Sum508
Variance0.05971629054
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0941595.3%
 
14294.3%
 
2270.3%
 
370.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
0941595.3%
 
14294.3%
 
2270.3%
 
370.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
41< 0.1%
 
370.1%
 
2270.3%
 
14294.3%
 
0941595.3%
 

blueTotalGold
Categorical

HIGH CORRELATION

Distinct count5
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
5206
Few
3632
Many
 
919
Very Few
 
69
Very Many
 
53
ValueCountFrequency (%) 
Normal520652.7%
 
Few363236.8%
 
Many9199.3%
 
Very Few690.7%
 
Very Many530.5%
 

Length

Max length9
Median length6
Mean length4.74106691
Min length3

blueAvgLevel
Real number (ℝ≥0)

Distinct count17
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.916003644093531
Minimum4.6
Maximum8.0
Zeros0
Zeros (%)0.0%
Memory size77.2 KiB

Quantile statistics

Minimum4.6
5-th percentile6.4
Q16.8
median7
Q37.2
95-th percentile7.4
Maximum8
Range3.4
Interquartile range (IQR)0.4

Descriptive statistics

Standard deviation0.3051458223
Coefficient of variation (CV)0.04412169773
Kurtosis1.116166722
Mean6.916003644
Median Absolute Deviation (MAD)0.2
Skewness-0.3385015794
Sum68323.2
Variance0.09311397286
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
7261126.4%
 
6.8244224.7%
 
7.2177918.0%
 
6.6133913.6%
 
7.46846.9%
 
6.45785.9%
 
6.21751.8%
 
7.61741.8%
 
6430.4%
 
7.8280.3%
 
Other values (7)260.3%
 
ValueCountFrequency (%) 
4.61< 0.1%
 
4.81< 0.1%
 
5.22< 0.1%
 
5.43< 0.1%
 
5.64< 0.1%
 
ValueCountFrequency (%) 
82< 0.1%
 
7.8280.3%
 
7.61741.8%
 
7.46846.9%
 
7.2177918.0%
 
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
5658
High
4192
Low
 
29
ValueCountFrequency (%) 
Normal565857.3%
 
High419242.4%
 
Low290.3%
 

Length

Max length6
Median length6
Mean length5.142524547
Min length3

blueTotalMinionsKilled
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
5047
High
4784
Low
 
48
ValueCountFrequency (%) 
Normal504751.1%
 
High478448.4%
 
Low480.5%
 

Length

Max length6
Median length6
Mean length5.016904545
Min length3
Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
8406
High
 
1307
Low
 
166
ValueCountFrequency (%) 
Normal840685.1%
 
High130713.2%
 
Low1661.7%
 

Length

Max length6
Median length6
Mean length5.684988359
Min length3

blueGoldDiff
Real number (ℝ)

Distinct count6047
Unique (%)61.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.414110739953436
Minimum-10830
Maximum11467
Zeros2
Zeros (%)< 0.1%
Memory size77.2 KiB

Quantile statistics

Minimum-10830
5-th percentile-4033.2
Q1-1585.5
median14
Q31596
95-th percentile4074
Maximum11467
Range22297
Interquartile range (IQR)3181.5

Descriptive statistics

Standard deviation2453.349179
Coefficient of variation (CV)170.2046851
Kurtosis0.2994089
Mean14.41411074
Median Absolute Deviation (MAD)1592
Skewness0.03003750876
Sum142397
Variance6018922.196
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
42880.1%
 
116770.1%
 
-180670.1%
 
-120860.1%
 
-35560.1%
 
-21160.1%
 
-256460.1%
 
-41860.1%
 
-97860.1%
 
-25960.1%
 
Other values (6037)981599.4%
 
ValueCountFrequency (%) 
-108301< 0.1%
 
-103291< 0.1%
 
-93411< 0.1%
 
-91521< 0.1%
 
-84721< 0.1%
 
ValueCountFrequency (%) 
114671< 0.1%
 
89771< 0.1%
 
88631< 0.1%
 
87761< 0.1%
 
86671< 0.1%
 

blueExperienceDiff
Real number (ℝ)

Distinct count5356
Unique (%)54.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-33.62030569895738
Minimum-9333
Maximum8348
Zeros1
Zeros (%)< 0.1%
Memory size77.2 KiB

Quantile statistics

Minimum-9333
5-th percentile-3206.1
Q1-1290.5
median-28
Q31212
95-th percentile3109.3
Maximum8348
Range17681
Interquartile range (IQR)2502.5

Descriptive statistics

Standard deviation1920.370438
Coefficient of variation (CV)-57.11936279
Kurtosis0.3648478761
Mean-33.6203057
Median Absolute Deviation (MAD)1252
Skewness0.02287603635
Sum-332135
Variance3687822.62
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6380.1%
 
-2970.1%
 
-102570.1%
 
-22670.1%
 
-29870.1%
 
41170.1%
 
-147670.1%
 
-114260.1%
 
24160.1%
 
193860.1%
 
Other values (5346)981199.3%
 
ValueCountFrequency (%) 
-93331< 0.1%
 
-85311< 0.1%
 
-82901< 0.1%
 
-82421< 0.1%
 
-73401< 0.1%
 
ValueCountFrequency (%) 
83481< 0.1%
 
82651< 0.1%
 
76451< 0.1%
 
76211< 0.1%
 
76091< 0.1%
 

blueCSPerMin
Categorical

HIGH CORRELATION

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
5047
High
4784
Low
 
48
ValueCountFrequency (%) 
Normal504751.1%
 
High478448.4%
 
Low480.5%
 

Length

Max length6
Median length6
Mean length5.016904545
Min length3

blueGoldPerMin
Categorical

HIGH CORRELATION

Distinct count5
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size77.2 KiB
Normal
5206
Few
3632
Many
 
919
Very Few
 
69
Very Many
 
53
ValueCountFrequency (%) 
Normal520652.7%
 
Few363236.8%
 
Many9199.3%
 
Very Few690.7%
 
Very Many530.5%
 

Length

Max length9
Median length6
Mean length4.74106691
Min length3

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

gameIdblueWinsblueWardsPlacedblueWardsDestroyedblueFirstBloodblueKillsblueDeathsblueAssistsblueEliteMonstersblueDragonsblueHeraldsblueTowersDestroyedblueTotalGoldblueAvgLevelblueTotalExperienceblueTotalMinionsKilledblueTotalJungleMinionsKilledblueGoldDiffblueExperienceDiffblueCSPerMinblueGoldPerMin
04519157822No282Yes96110000Normal6.6NormalNormalNormal643-8NormalNormal
14523371949No121No5550000Few6.6NormalNormalNormal-2908-1173NormalFew
24521474530No150No71141100Normal6.4NormalNormalNormal-1172-1033NormalNormal
34524384067No431No4551010Few7.0NormalNormalNormal-1321-7NormalFew
44436033771No754No6660000Normal7.0HighNormalNormal-1004230NormalNormal
54475365709Yes180No5361100Few7.0NormalHighNormal698101HighFew
64493010632Yes183Yes7671100Normal6.8NormalHighNormal24111563HighNormal
74496759358No162No51330000Few6.4NormalNormalNormal-2615-800NormalFew
84443048030No163No7780000Normal7.2HighNormalNormal-1979-771NormalNormal
94509433346Yes131Yes4551100Few6.8NormalHighNormal-1548-1574HighFew

Last rows

gameIdblueWinsblueWardsPlacedblueWardsDestroyedblueFirstBloodblueKillsblueDeathsblueAssistsblueEliteMonstersblueDragonsblueHeraldsblueTowersDestroyedblueTotalGoldblueAvgLevelblueTotalExperienceblueTotalMinionsKilledblueTotalJungleMinionsKilledblueGoldDiffblueExperienceDiffblueCSPerMinblueGoldPerMin
98694527875317No121No912121100Normal7.0HighNormalNormal-2121-1038NormalNormal
98704527811425Yes462Yes5320000Normal7.2HighHighHigh19741712HighNormal
98714527715781No122No4552110Few6.8HighNormalNormal-727343NormalFew
98724527650398Yes120Yes7790000Normal7.0NormalHighNormal7561HighNormal
98734527878058Yes182Yes126130000Many7.2HighNormalNormal26392364NormalMany
98744527873286Yes172Yes7451100Normal7.2HighNormalHigh25192469NormalNormal
98754527797466Yes540No6481100Normal7.2HighHighNormal782888HighNormal
98764527713716No231No6750000Few7.0NormalNormalNormal-2416-1877NormalFew
98774527628313No144Yes2331100Few6.6NormalHighNormal-839-1085HighFew
98784523772935Yes180Yes6650000Normal7.0NormalNormalNormal927-58NormalNormal